智能论文笔记

Frouros: A Python library for drift detection in Machine Learning problems

Jaime Céspedes Sisniega , Álvaro López García

分类：机器学习

2022-08-14

弗洛罗斯（Frolos）是一个python库，能够检测机器学习问题的漂移。它提供了用于漂移检测的经典和较新的算法的组合：受到监督和无监督，以及一些能够以半监督的方式行动的能力。我们设计了它的目的是与Scikit-Learn库轻松集成，并实现相同的应用程序编程界面。图书馆是根据一组最佳开发和持续整合实践开发的，以确保易于维护和可扩展性。源代码可在https://github.com/ifca/frouros上获得。

translated by 谷歌翻译

Study of the performance and scalablity of federated learning for medical imaging with intermittent clients

Judith Sáinz-Pardo Díaz , Álvaro López García

分类：机器学习 | 人工智能 | 计算机视觉

2022-07-18

联合学习是一种数据解散隐私化技术，用于以安全的方式执行机器或深度学习。在本文中，我们介绍了有关联合学习的理论方面客户次数有所不同的用例。具体而言，使用从开放数据存储库中获得的胸部X射线图像提出了医学图像分析的用例。除了与隐私相关的优势外，还将研究预测的改进（就曲线下的准确性和面积而言）和减少执行时间（集中式方法）。将从培训数据中模拟不同的客户，以不平衡的方式选择，即，他们并非都有相同数量的数据。考虑三个或十个客户之间的结果与集中案件相比。间歇性客户将分析两种遵循方法，就像在实际情况下，某些客户可能会离开培训，一些新的新方法可能会进入培训。根据准确性，曲线下的区域和执行时间的结果，结果的结果的演变显示为原始数据被划分的客户次数。最后，提出了该领域的改进和未来工作。

translated by 谷歌翻译

Forecasting COVID-19 spreading trough an ensemble of classical and machine learning models: Spain's case study

Ignacio Heredia Cacha , Judith Sainz-Pardo Díaz , María Castrillo Melguizo , Álvaro López García

分类：机器学习 | 人工智能

2022-07-12

在这项工作中，我们评估了人口模型和机器学习模型的合奏，以预测COVID-19大流行的不久的将来的演变，并在西班牙有特殊的用例。我们仅依靠开放和公共数据集，将发生率，疫苗接种，人类流动性和天气数据融合来喂养我们的机器学习模型（随机森林，梯度增强，K-Nearest邻居和内核岭回归）。我们使用发病率数据来调整经典人群模型（Gompertz，Logistic，Richards，Bertalanffy），以便能够更好地捕获数据的趋势。然后，我们整合了这两个模型家族，以获得更强大，更准确的预测。此外，我们已经观察到，当我们添加新功能（疫苗，移动性，气候条件）时，使用机器学习模型获得的预测有所改善，使用Shapley添加说明值分析了每个功能的重要性。就像在任何其他建模工作中一样，数据和预测质量都有多个局限性，因此必须从关键的角度看待它们，如我们在文本中所讨论的那样。我们的工作得出的结论是，这些模型的合奏使用可以改善单个预测（仅使用机器学习模型或仅使用人口模型），并且在由于缺乏相关数据而无法使用隔室模型的情况下，可以谨慎地应用。

translated by 谷歌翻译

Vehicle Trajectory Prediction on Highways Using Bird Eye View Representations and Deep Learning

Rubén Izquierdo , Álvaro Quintanar , David Fernández Llorca , Iván García Daza , Noelia Hernández , Ignacio Parra , Miguel Ángel Sotelo

分类：计算机视觉 | 人工智能

2022-07-04

这项工作提出了一种新的方法，可以使用有效的鸟类视图表示和卷积神经网络在高速公路场景中预测车辆轨迹。使用基本的视觉表示，很容易将车辆位置，运动历史，道路配置和车辆相互作用轻松包含在预测模型中。 U-NET模型已被选为预测内核，以使用图像到图像回归方法生成场景的未来视觉表示。已经实施了一种方法来从生成的图形表示中提取车辆位置以实现子像素分辨率。该方法已通过预防数据集（一个板载传感器数据集）进行了培训和评估。已经评估了不同的网络配置和场景表示。这项研究发现，使用线性终端层和车辆的高斯表示，具有6个深度水平的U-NET是最佳性能配置。发现使用车道标记不会改善预测性能。平均预测误差为0.47和0.38米，对于纵向和横向坐标的最终预测误差分别为0.76和0.53米，预测轨迹长度为2.0秒。与基线方法相比，预测误差低至50％。

translated by 谷歌翻译

Segmentation of Multiple Myeloma Plasma Cells in Microscopy Images with Noisy Labels

Álvaro García Faura , Dejan Štepec , Tomaž Martinčič , Danijel Skočaj

分类：计算机视觉

2021-11-08

改善和快速癌症诊断的关键组成部分是计算机辅助工具的发展。在本文中，我们提出了赢得SEGPC-2021竞争的解决方案，用于在显微镜图像中分割多发性骨髓瘤等离子体细胞。竞争数据集中使用的标签是生成半自动和呈现的噪声。要处理它，进行了沉重的图像增强程序，并使用自定义集合策略相结合了来自多种模型的预测。使用最先进的功能提取器和实例分段架构，导致SEGPC-2021最终测试集上的0.9389的平均交叉联盟。

translated by 谷歌翻译

Countering Malicious Content Moderation Evasion in Online Social Networks: Simulation and Detection of Word Camouflage

Álvaro Huertas-García , Alejandro Martín , Javier Huertas Tato , David Camacho

分类：自然语言处理 | 人工智能

2022-12-27

Content moderation is the process of screening and monitoring user-generated content online. It plays a crucial role in stopping content resulting from unacceptable behaviors such as hate speech, harassment, violence against specific groups, terrorism, racism, xenophobia, homophobia, or misogyny, to mention some few, in Online Social Platforms. These platforms make use of a plethora of tools to detect and manage malicious information; however, malicious actors also improve their skills, developing strategies to surpass these barriers and continuing to spread misleading information. Twisting and camouflaging keywords are among the most used techniques to evade platform content moderation systems. In response to this recent ongoing issue, this paper presents an innovative approach to address this linguistic trend in social networks through the simulation of different content evasion techniques and a multilingual Transformer model for content evasion detection. In this way, we share with the rest of the scientific community a multilingual public tool, named "pyleetspeak" to generate/simulate in a customizable way the phenomenon of content evasion through automatic word camouflage and a multilingual Named-Entity Recognition (NER) Transformer-based model tuned for its recognition and detection. The multilingual NER model is evaluated in different textual scenarios, detecting different types and mixtures of camouflage techniques, achieving an overall weighted F1 score of 0.8795. This article contributes significantly to countering malicious information by developing multilingual tools to simulate and detect new methods of evasion of content on social networks, making the fight against information disorders more effective.

translated by 谷歌翻译

Spacecraft Pose Estimation Based on Unsupervised Domain Adaptation and on a 3D-Guided Loss Combination

Juan Ignacio Bravo Pérez-Villar , Álvaro García-Martín , Jesús Bescós

分类：计算机视觉

2022-12-27

Spacecraft pose estimation is a key task to enable space missions in which two spacecrafts must navigate around each other. Current state-of-the-art algorithms for pose estimation employ data-driven techniques. However, there is an absence of real training data for spacecraft imaged in space conditions due to the costs and difficulties associated with the space environment. This has motivated the introduction of 3D data simulators, solving the issue of data availability but introducing a large gap between the training (source) and test (target) domains. We explore a method that incorporates 3D structure into the spacecraft pose estimation pipeline to provide robustness to intensity domain shift and we present an algorithm for unsupervised domain adaptation with robust pseudo-labelling. Our solution has ranked second in the two categories of the 2021 Pose Estimation Challenge organised by the European Space Agency and the Stanford University, achieving the lowest average error over the two categories.

translated by 谷歌翻译

Automated Gadget Discovery in Science

Lea M. Trenkwalder , Andrea López Incera , Hendrik Poulsen Nautrup , Fulvio Flamini , Hans J. Briegel

分类：人工智能 | 机器学习

2022-12-24

In recent years, reinforcement learning (RL) has become increasingly successful in its application to science and the process of scientific discovery in general. However, while RL algorithms learn to solve increasingly complex problems, interpreting the solutions they provide becomes ever more challenging. In this work, we gain insights into an RL agent's learned behavior through a post-hoc analysis based on sequence mining and clustering. Specifically, frequent and compact subroutines, used by the agent to solve a given task, are distilled as gadgets and then grouped by various metrics. This process of gadget discovery develops in three stages: First, we use an RL agent to generate data, then, we employ a mining algorithm to extract gadgets and finally, the obtained gadgets are grouped by a density-based clustering algorithm. We demonstrate our method by applying it to two quantum-inspired RL environments. First, we consider simulated quantum optics experiments for the design of high-dimensional multipartite entangled states where the algorithm finds gadgets that correspond to modern interferometer setups. Second, we consider a circuit-based quantum computing environment where the algorithm discovers various gadgets for quantum information processing, such as quantum teleportation. This approach for analyzing the policy of a learned agent is agent and environment agnostic and can yield interesting insights into any agent's policy.

translated by 谷歌翻译

Learning efficient backprojections across cortical hierarchies in real time

Kevin Max , Laura Kriener , Garibaldi Pineda García , Thomas Nowotny , Walter Senn , Mihai A. Petrovici

分类：机器学习 | 神经与进化计算

2022-12-20

Models of sensory processing and learning in the cortex need to efficiently assign credit to synapses in all areas. In deep learning, a known solution is error backpropagation, which however requires biologically implausible weight transport from feed-forward to feedback paths. We introduce Phaseless Alignment Learning (PAL), a bio-plausible method to learn efficient feedback weights in layered cortical hierarchies. This is achieved by exploiting the noise naturally found in biophysical systems as an additional carrier of information. In our dynamical system, all weights are learned simultaneously with always-on plasticity and using only information locally available to the synapses. Our method is completely phase-free (no forward and backward passes or phased learning) and allows for efficient error propagation across multi-layer cortical hierarchies, while maintaining biologically plausible signal transport and learning. Our method is applicable to a wide class of models and improves on previously known biologically plausible ways of credit assignment: compared to random synaptic feedback, it can solve complex tasks with less neurons and learn more useful latent representations. We demonstrate this on various classification tasks using a cortical microcircuit model with prospective coding.

translated by 谷歌翻译

Improving Depression estimation from facial videos with face alignment, training optimization and scheduling

Manuel Lage Cañellas , Constantino Álvarez Casado , Le Nguyen , Miguel Bordallo López

分类：计算机视觉 | 人工智能

2022-12-13

Deep learning models have shown promising results in recognizing depressive states using video-based facial expressions. While successful models typically leverage using 3D-CNNs or video distillation techniques, the different use of pretraining, data augmentation, preprocessing, and optimization techniques across experiments makes it difficult to make fair architectural comparisons. We propose instead to enhance two simple models based on ResNet-50 that use only static spatial information by using two specific face alignment methods and improved data augmentation, optimization, and scheduling techniques. Our extensive experiments on benchmark datasets obtain similar results to sophisticated spatio-temporal models for single streams, while the score-level fusion of two different streams outperforms state-of-the-art methods. Our findings suggest that specific modifications in the preprocessing and training process result in noticeable differences in the performance of the models and could hide the actual originally attributed to the use of different neural network architectures.

translated by 谷歌翻译